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ffECH CENTER 1600/2900 



OIPE 



RAW SEQUENCE LISTING DATE: 06/19/2001 

PATENT APPLICATION: US/09/525,041 TIME: 12:33:29 

Input Set : A:\PF178D2_SubSeqList.txt P MT F R F D 

Output Set: N:\CRF3\06192001\I525041.raw C iM ■ 1—111—1-^ 

3 <110> APPLICANT: Soppet et al . 

5 <120> TITLE OF INVENTION: Colon Specific Gene and Protein 

7 <130> FILE REFERENCE: PF178D2 

9 <140> CURRENT APPLICATION NUMBER: US 09/525,041 

10 <I41> CURRENT FILING DATE: 2001-06-04 

12 <150> PRIOR APPLICATION NUMBER: US 09/162,508 

13 <151> PRIOR FILING DATE: 1998-09-29 

15 <150> PRIOR APPLICATION NUMBER: US 08/468,413 

16 <151> PRIOR FILING DATE: 1995-06-06 
18 <160> NUMBER OF SEQ ID NOS : 6 

20 <170> SOFTWARE: Patentin version 3.0 

22 <210> SEQ ID NO: 1 

23 <211> LENGTH: 1114 

24 <212> TYPE: DNA 

25 <213> ORGANISM: Homo sapiens 

27 <220> FEATURE: 

28 <221> NAME/KEY 



29 <222> LOCATION 
31 <400> SEQUENCE 



CDS 

(111) . .(587) 
1 

32 gcacgaggcc aaacagattt gcagatcaag gagaacccag gagtttcaaa gaagcgctag 60 

34 taaggtctct gagatccttg cactagctac atcctcaggg taggaggaag atg get 116 

35 Met Ala 

36 1 
38 
39 
40 
42 
43 
44 
46 
47 
48 
50 
51 
52 
54 
55 
56 
58 
59 
60 
62 
63 
64 
66 
67 
68 



tec 


aga 


age 


atg 


egg 


ctg 


etc 


eta 


ttg 


ctg 


age 


tge 


ctg 


gee 


aaa 


aca 


164 


Ser 


Arg 


Ser 


Met 


Arg 


Leu 


Leu 


Leu 


Leu 


Leu 


Ser 


Cys 


Leu 


Ala 


Lys 


Thr 








5 










10 










15 










gga 


gtc 


ctg 


gc[t. gat 


ate 


ate 


atg 


aga 


ecc 


age 


tgt 


get 


cet 


gga 


tgg 


212 


Gly 


Val 


Ifeu 


fiiy Asp 


He 


He 


Met 


Arg 


Pro 


Ser 


Cys 


Ala 


Pro 


Gly 


Trp 


if • 




20 










25 










30 












ttt 


tac 


cac 


aag 


tec 


aat 


tge 


tat 


ggt 


tac 


tte 


agg 


aag 


ctg 


agg 


aac 


260 


Phe 


Tyr 


His 


Lys 


Ser 


Asn 


Cys 


Tyr 


Gly 


Tyr 


Phe 


Arg 


Lys 


Leu 


Arg 


Asn 




35 










40 










45 










50 




tgg 


tct 


gat 


gee 


gag 


etc 


gag 


tgt 


cag 


tct 


tae 


gga 


aac 


gga 


gee 


cac 


308 


Trp 


Ser 


Asp 


Ala 


Glu 


Leu 


Glu 


Cys 


Gin 


Ser 


Tyr 


Gly 


Asn 


Gly 


Ala 


His 












55 










60 










65 






ctg 


gca 


tct 


ate 


ctg 


agt 


tta 


aag 


gaa 


gee 


age 


ace 


ata 


gca 


gag 


tac 


356 


Leu 


Ala 


Ser 


He 


Leu 


Ser 


Leu 


Lys 


Glu 


Ala 


Ser 


Thr 


He 


Ala 


Glu 


Tyr 










70 










75 










80 








ata 


agt 


ggc 


tat 


cag 


aga 


age 


cag 


ceg 


ata 


tgg 


att 


ggc 


ctg 


cac 


gac 


404 


He 


Ser 


Gly 


Tyr 


Gin 


Arg 


Ser 


Gin 


Pro 


He 


Trp 


He 


Gly 


Leu 


His 


Asp 








85 










90 










95 










cca 


cag 


aag 


agg 


cag 


cag 


tgg 


cag 


tgg 


att 


gat 


ggg 


gee 


atg 


tat 


ctg 


452 


Pro 


Gin 


Lys 


Arg 


Gin 


Gin 


Trp 


Gin 


Trp 


He 


Asp 


Gly 


Ala 


Met 


Tyr 


Leu 






100 










105 










110 












tac 


aga 


tec 


tgg 


tct 


ggc 


aag 


tec 


atg 


ggt 


ggg 


aac 


aag 


cac 


tgt 


get 


500 


Tyr 


Arg 


Ser 


Trp 


Ser 


Gly 


Lys 


Ser 


Met Gly 


Gly 


Asn 


Lys 


His 


Cys 


Ala 




115 










120 










125 










130 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/525,041 



DATE: 06/19/2001 
TIME: 12:33:29 



Input Set : A:\PF178D2_SubSeqList.txt 
Output Set: N:\CRF3\06192001\I52504I.raw 



70 gag atg age tec aat aae aae ttt tta act tgg age age aac gaa tgc 

71 Glu Met Ser Ser Asn Asn Asn Phe Leu Thr Trp Ser Ser Asn Glu Cys 



74 aac aag cgc eaa cac tte ctg tgc aag tac cga cca tag agcaagaatc 

75 Asn Lys Arg Gin His Phe Leu Cys Lys Tyr Arg Pro 



78 aagattctgc taactectgc acagcecegt cctcttcctt tctgctagcc tggctaaatc 
80 tgctcattat ttcagagggg aaacctagea aactaagagt gataagggec etactacact 
82 ggctttttta ggcttagaga cagaaacttt agcattggce cagtagtggc ttctagctct 
84 aaatgtttgc cccgceatec ctttccacag tatccttctt ccctcctccc ctgtctctgg 
86 ctgtctcgag cagtctagaa gagtgcatct ccagcctatg aaacagctgg gtctttggcc 
88 ataagaagta aagatttgaa gacagaagga agaaactcag gagtaagctt ctagacccct 
90 teagcttcta cacccttctg ccctctctec attgcctgca ccecacccca gccactcaac 
92 tcctgcttgt ttttcetttg gccataggaa ggtttaccag tagaatcctt gctaggttga 
94 tgtgggccat acattccttt aataaaccat tgtgtac 

97 <210> SEQ ID NO: 2 

98 <211> LENGTH: 158 

99 <212> TYPE: PRT 

100 <213> ORGANISM: Homo sapiens 
102 <400> SEQUENCE: 2 



104 


Met 


Ala 


Ser 


Arg 


Ser 


Met 


Arg 


Leu 


Leu 


Leu 


Leu 


Leu 


Ser 


Cys 


Leu 


Ala 


105 


1 








5 










10 










15 




108 


Lys 


Thr 


Gly 


Val 


Leu 


Gly 


Asp 


He 


He 


Met 


Arg 


Pro 


Ser 


Cys 


Ala 


Pro 


109 








20 










25 










30 






112 


Gly 


Trp 


Phe 


Tyr 


His 


Lys 


Ser 


Asn 


Cys 


Tyr 


Gly 


Tyr 


Phe 


Arg 


Lys 


Leu 


113 






35 










40 










45 








116 


Arg 


Asn 


Trp 


Ser 


Asp 


Ala 


Glu 


Leu 


Glu 


Cys 


Gin 


Ser 


Tyr 


Gly 


Asn 


Gly 


117 




50 










55 










60 










120 


Ala 


His 


Leu 


Ala 


Ser 


He 


Leu 


Ser 


Leu 


Lys 


Glu 


Ala 


Ser 


Thr 


He 


Ala 


121 


65 










70 










75 










80 


124 


Glu 


Tyr 


He 


Ser 


Gly 


Tyr 


Gin 


Arg 


Ser 


Gin 


Pro 


He 


Trp 


He 


Gly 


Leu 


125 










85 










90 










95 




128 


His 


Asp 


Pro 


Gin 


Lys 


Arg 


Gin 


Gin 


Trp 


Gin 


Trp 


He 


Asp 


Gly 


Ala 


Met 


129 








100 










105 










110 






132 


Tyr 


Leu 


Tyr 


Arg 


Ser 


Trp 


Ser 


Gly 


Lys 


Ser 


Met 


Gly 


Gly 


Asn 


Lys 


His 


133 






115 










120 










125 








136 


Cys 


Ala 


Glu 


Met 


Ser 


Ser 


Asn 


Asn 


Asn 


Phe 


Leu 


Thr 


Trp 


Ser 


Ser 


Asn 


137 




130 










135 










140 










140 


Glu 


Cys 


Asn 


Lys 


Arg 


Gin 


His 


Phe 


Leu 


Cys 


Lys 


Tyr 


Arg 


Pro 







141 145 150 155 

144 ,<210> SEQ ID NO: 3 

145 <211> LENGTH: 26 

146 <212> TYPE: DNA 

147 <213> ORGANISM: Artificial Sequence 

149 <220> FEATURE: 

150 <223> OTHER INFORMATION: Contains a BamHI restriction enzyme site. 

152 <400> SEQUENCE: 3 

153 gcaggatcct ggcttecaga agcatg 
156 <210> SEQ ID NO: 4 



548 



597 



657 
• 717 
777 
837 
897 
957 
1017 
1077 
1114 
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RAW SEQUENCE LISTING DATE: 06/19/2001 

PATENT APPLICATION: US/09/525,041 TIME: 12:33:29 

Input Set : A:\PF178D2_Sul)SeqList, txt 
Output Set: N:\CRF3\06192001\I525041.raw 

157 <211> LENGTH: 28 

158 <212> TYPE: DNA 

159 <213> ORGANISM: Artificial Sequence 

161 <220> FEATURE: 

162 <223> OTHER INFORMATION: Contains complementary sequences to an Asp718 restriction 
enzyme 

163 site. 

165 <400> SEQUENCE: 4 

166 tacgggtacc ttgctctatg gtcggtac 28 

169 <210> SEQ ID NO: 5 

170 <211> LENGTH: 36 

171 <212> TYPE: DNA I 

172 <213> ORGANISM: Artificial Sequence 

174 <220> FEATURE: 

175 <223> OTHER INFORMATION: Contains a BamHI restriction enzyme site followed by 6 
nucleotide 

176 s resembling an efficient signal for the initiation of translatio 

177 n in eukaryotic cells. 

179 <400> SEQUENCE: 5 

180 atcgggatcc gccatcatgg cttccagaag catgcg 36 

183 <210> SEQ ID NO: 6 

184 <211> LENGTH: 28 

185 <212> TYPE: DNA 

186 <213> ORGANISM: Artificial Sequence 

188 <220> FEATURE: 

189 <223> OTHER INFORMATION: Contains complementary sequences to an Asp718 restriction 
enzyme 

190 site. 

192 <400> SEQUENCE: 6 

193 tacgggtacc ttgctctatg gtcggtac 28 
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PATENT APPLICATION: US/09/525,041 TIME: 12:33:30 
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file://C:\CRF3\Outhold\VsrI525041 .htm 



